Spectrum structures and biological functions of 8-mers in the human genome.
Identifieur interne : 000377 ( Main/Exploration ); précédent : 000376; suivant : 000378Spectrum structures and biological functions of 8-mers in the human genome.
Auteurs : Yun Jia [République populaire de Chine] ; Hong Li [République populaire de Chine] ; Jingfeng Wang [République populaire de Chine] ; Hu Meng [République populaire de Chine] ; Zhenhua Yang [République populaire de Chine]Source :
- Genomics [ 1089-8646 ] ; 2019.
Descripteurs français
- KwdFr :
- MESH :
- métabolisme : Nucléosomes.
- Analyse de séquence d'ADN, Biologie informatique, Génome humain, Humains, Ilots CpG, Motifs nucléotidiques.
English descriptors
- KwdEn :
- MESH :
- chemical , metabolism : Nucleosomes.
- Computational Biology, CpG Islands, Genome, Human, Humans, Nucleotide Motifs, Sequence Analysis, DNA.
Abstract
The spectra of k-mer frequencies can reveal the structures and evolution of genome sequences. We confirmed that the trimodal spectrum of 8-mers in human genome sequences is distinguished only by CG2, CG1 and CG0 8-mer sets, containing 2,1 or 0 CpG, respectively. This phenomenon is called independent selection law. The three types of CG 8-mers were considered as different functional elements. We conjectured that (1) nucleosome binding motifs are mainly characterized by CG1 8-mers and (2) the core structural units of CpG island sequences are predominantly characterized by CG2 8-mers. To validate our conjectures, nucleosome occupied sequences and CGI sequences were extracted, then the sequence parameters were constructed through the information of the three CG 8-mer sets respectively. ROC analysis showed that CG1 8-mers are more preference in nucleosome occupied segments (AUC > 0.7) and CG2 8-mers are more preference in CGI sequences (AUC > 0.99). This validates our conjecture in principle.
DOI: 10.1016/j.ygeno.2018.03.006
PubMed: 29522801
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: 000975
- to stream PubMed, to step Curation: 000975
- to stream PubMed, to step Checkpoint: 000377
- to stream Ncbi, to step Merge: 001D57
- to stream Ncbi, to step Curation: 001D57
- to stream Ncbi, to step Checkpoint: 001D57
- to stream Main, to step Merge: 000380
- to stream Main, to step Curation: 000377
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Spectrum structures and biological functions of 8-mers in the human genome.</title>
<author><name sortKey="Jia, Yun" sort="Jia, Yun" uniqKey="Jia Y" first="Yun" last="Jia">Yun Jia</name>
<affiliation wicri:level="1"><nlm:affiliation>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China; College of Science, Inner Mongolia University of Technology, Hohhot 010051, China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China; College of Science, Inner Mongolia University of Technology, Hohhot 010051</wicri:regionArea>
<wicri:noRegion>Hohhot 010051</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Li, Hong" sort="Li, Hong" uniqKey="Li H" first="Hong" last="Li">Hong Li</name>
<affiliation wicri:level="1"><nlm:affiliation>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China. Electronic address: ndlihong@imu.edu.cn.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021</wicri:regionArea>
<wicri:noRegion>Hohhot 010021</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Wang, Jingfeng" sort="Wang, Jingfeng" uniqKey="Wang J" first="Jingfeng" last="Wang">Jingfeng Wang</name>
<affiliation wicri:level="1"><nlm:affiliation>College of Science, Inner Mongolia University of Technology, Hohhot 010051, China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>College of Science, Inner Mongolia University of Technology, Hohhot 010051</wicri:regionArea>
<wicri:noRegion>Hohhot 010051</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Meng, Hu" sort="Meng, Hu" uniqKey="Meng H" first="Hu" last="Meng">Hu Meng</name>
<affiliation wicri:level="1"><nlm:affiliation>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021</wicri:regionArea>
<wicri:noRegion>Hohhot 010021</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Yang, Zhenhua" sort="Yang, Zhenhua" uniqKey="Yang Z" first="Zhenhua" last="Yang">Zhenhua Yang</name>
<affiliation wicri:level="1"><nlm:affiliation>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021</wicri:regionArea>
<wicri:noRegion>Hohhot 010021</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2019">2019</date>
<idno type="RBID">pubmed:29522801</idno>
<idno type="pmid">29522801</idno>
<idno type="doi">10.1016/j.ygeno.2018.03.006</idno>
<idno type="wicri:Area/PubMed/Corpus">000975</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">000975</idno>
<idno type="wicri:Area/PubMed/Curation">000975</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">000975</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000377</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">000377</idno>
<idno type="wicri:Area/Ncbi/Merge">001D57</idno>
<idno type="wicri:Area/Ncbi/Curation">001D57</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">001D57</idno>
<idno type="wicri:Area/Main/Merge">000380</idno>
<idno type="wicri:Area/Main/Curation">000377</idno>
<idno type="wicri:Area/Main/Exploration">000377</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Spectrum structures and biological functions of 8-mers in the human genome.</title>
<author><name sortKey="Jia, Yun" sort="Jia, Yun" uniqKey="Jia Y" first="Yun" last="Jia">Yun Jia</name>
<affiliation wicri:level="1"><nlm:affiliation>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China; College of Science, Inner Mongolia University of Technology, Hohhot 010051, China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China; College of Science, Inner Mongolia University of Technology, Hohhot 010051</wicri:regionArea>
<wicri:noRegion>Hohhot 010051</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Li, Hong" sort="Li, Hong" uniqKey="Li H" first="Hong" last="Li">Hong Li</name>
<affiliation wicri:level="1"><nlm:affiliation>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China. Electronic address: ndlihong@imu.edu.cn.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021</wicri:regionArea>
<wicri:noRegion>Hohhot 010021</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Wang, Jingfeng" sort="Wang, Jingfeng" uniqKey="Wang J" first="Jingfeng" last="Wang">Jingfeng Wang</name>
<affiliation wicri:level="1"><nlm:affiliation>College of Science, Inner Mongolia University of Technology, Hohhot 010051, China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>College of Science, Inner Mongolia University of Technology, Hohhot 010051</wicri:regionArea>
<wicri:noRegion>Hohhot 010051</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Meng, Hu" sort="Meng, Hu" uniqKey="Meng H" first="Hu" last="Meng">Hu Meng</name>
<affiliation wicri:level="1"><nlm:affiliation>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021</wicri:regionArea>
<wicri:noRegion>Hohhot 010021</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Yang, Zhenhua" sort="Yang, Zhenhua" uniqKey="Yang Z" first="Zhenhua" last="Yang">Zhenhua Yang</name>
<affiliation wicri:level="1"><nlm:affiliation>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021</wicri:regionArea>
<wicri:noRegion>Hohhot 010021</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series><title level="j">Genomics</title>
<idno type="eISSN">1089-8646</idno>
<imprint><date when="2019" type="published">2019</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Computational Biology</term>
<term>CpG Islands</term>
<term>Genome, Human</term>
<term>Humans</term>
<term>Nucleosomes (metabolism)</term>
<term>Nucleotide Motifs</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr"><term>Analyse de séquence d'ADN</term>
<term>Biologie informatique</term>
<term>Génome humain</term>
<term>Humains</term>
<term>Ilots CpG</term>
<term>Motifs nucléotidiques</term>
<term>Nucléosomes (métabolisme)</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="metabolism" xml:lang="en"><term>Nucleosomes</term>
</keywords>
<keywords scheme="MESH" qualifier="métabolisme" xml:lang="fr"><term>Nucléosomes</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Computational Biology</term>
<term>CpG Islands</term>
<term>Genome, Human</term>
<term>Humans</term>
<term>Nucleotide Motifs</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr"><term>Analyse de séquence d'ADN</term>
<term>Biologie informatique</term>
<term>Génome humain</term>
<term>Humains</term>
<term>Ilots CpG</term>
<term>Motifs nucléotidiques</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">The spectra of k-mer frequencies can reveal the structures and evolution of genome sequences. We confirmed that the trimodal spectrum of 8-mers in human genome sequences is distinguished only by CG2, CG1 and CG0 8-mer sets, containing 2,1 or 0 CpG, respectively. This phenomenon is called independent selection law. The three types of CG 8-mers were considered as different functional elements. We conjectured that (1) nucleosome binding motifs are mainly characterized by CG1 8-mers and (2) the core structural units of CpG island sequences are predominantly characterized by CG2 8-mers. To validate our conjectures, nucleosome occupied sequences and CGI sequences were extracted, then the sequence parameters were constructed through the information of the three CG 8-mer sets respectively. ROC analysis showed that CG1 8-mers are more preference in nucleosome occupied segments (AUC > 0.7) and CG2 8-mers are more preference in CGI sequences (AUC > 0.99). This validates our conjecture in principle.</div>
</front>
</TEI>
<affiliations><list><country><li>République populaire de Chine</li>
</country>
</list>
<tree><country name="République populaire de Chine"><noRegion><name sortKey="Jia, Yun" sort="Jia, Yun" uniqKey="Jia Y" first="Yun" last="Jia">Yun Jia</name>
</noRegion>
<name sortKey="Li, Hong" sort="Li, Hong" uniqKey="Li H" first="Hong" last="Li">Hong Li</name>
<name sortKey="Meng, Hu" sort="Meng, Hu" uniqKey="Meng H" first="Hu" last="Meng">Hu Meng</name>
<name sortKey="Wang, Jingfeng" sort="Wang, Jingfeng" uniqKey="Wang J" first="Jingfeng" last="Wang">Jingfeng Wang</name>
<name sortKey="Yang, Zhenhua" sort="Yang, Zhenhua" uniqKey="Yang Z" first="Zhenhua" last="Yang">Zhenhua Yang</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000377 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000377 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= MersV1 |flux= Main |étape= Exploration |type= RBID |clé= pubmed:29522801 |texte= Spectrum structures and biological functions of 8-mers in the human genome. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i -Sk "pubmed:29522801" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd \ | NlmPubMed2Wicri -a MersV1
This area was generated with Dilib version V0.6.33. |